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TITLE OF THE INVENTION 
METHOD AND APPARATUS FOR REQUEST/GRANT PRIORITY SCHEDULING 

BACKGROUND OF THE INVENTION 

(1) Field of the Invention 

[0001] The invention relates to the problem of scheduling traffic in a switch/router. 
More particularly, the invention relates to the problem of reducing switch latency for high 
priority, or real-time, traffic in a switch/router that uses a request/grant mechanism for 
traffic scheduling. 

(2) Description of the Related Art 

[0002] Communication networks typically use devices for directing the flow of data 
through them. Such devices are often characterized as switches and routers, which may 
be referred to collectively as switches/routers. A switch/router often needs to process 
data of different priorities according to different criteria. However, switches/routers 
often exhibit deficiencies that prevent them from maintain conformance with some of the 
particular criteria that may be desired. 

[0003] Switch/routers commonly have a structure in which processing circuitry 
associated with ports to external lines is located on line cards, which may contain the 
ports and circuitry associated with one or more external lines. Among other functions, 
the circuitry determines the preferred destination of arriving packets. Packets are 
transmitted to the line card(s) associated with the preferred destination(s) by means of a 
switching fabric. To provide a high performance switch/router, it is necessary for the 
switch fabric to efficiently carry data between line cards with high data rates and low 
latency. In order to meet these requirements, an important class of switching fabric has 
emerged. It is characterized by a request/grant structure. Line cards request access to use 
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a path through the fabric to carry data to the desired destination line card. An arbitrator 
associated with the switch fabric processes the requests from all line cards to determine a 
way to grant access to line cards to optimize fabric utilization and fairness and other 
criteria. Once the decision is made, access is granted to line cards to send data destined 
for a particular destination through the switch fabric at a particular time. The switch 
fabric is configured so that, at that particular time, data sent from selected inputs is 
transferred to selected outputs corresponding to the grants given. The latency between 
data being presented to the fabric and it arriving at the destination line cards is 
deterministic. Thus, no reordering of the data occurs within the switch fabric. Such a 
class of switch fabric presents an excellent example of a context to which at least one 
embodiment of the present invention may be beneficially applied. 

[0004] Figure 1 is a timing diagram illustrating a prior art technique for scheduling 
traffic in a communications node. Events occur at varying relative times along time 
access 101. Events are categorized as occurring during a request phase 102, the grant 
phase 103, and a data phase 104. During request phase 102, a line card generates 
requests and communicates them to a switch fabric. In the illustrated example, a line 
card first generates and communicates low priority (LP) request 105 then, at time 113, 
generates and communicates high priority (HP) requests 106. 

[0005] At grant phase 103, the switch fabric grants opportunities for the line card to 
communicate the indicated data through the switch fabric. In the illustrated example, the 
switch fabric issues grant 107 corresponding to the low priority request 105 and high 
priority grant 108, which corresponds to high priority request 106 as shown by 
connection 111. However, in the illustrated example, high priority grant 108 occurs at 
time 1 14, which follows time 1 13 by a duration 1 15. Since certain latency requirements 
maybe placed upon high priority data, a limit may be placed on duration 1 15 between 
issuance of high priority request 106 at time 113 and issuance of high priority grant 108 
at time 1 14. Under some circumstances, if high priority grant 108 occurs well after the 
issuance of high priority request 106, duration 15 may exceed such latency limits. 
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[0006] At data phase 104, the line card communicates the data associated with the 
granted requests from the line card to the switch fabric. In the illustrated example, data 
communication corresponding to the low priority request and grant occurs slightly after 
the issuance of low priority grant 107, and data communication 1 10 corresponding to the 
high priority request and grant occurs shortly after the issuance of grant 108, as illustrated 
by connection 112. 

[0007] Figure 1 shows an example prior art request/grant chronological sequence of 
low priority (LP) and high priority (HP) requests, grants and their associated data 
transmissions (e.g., an ATM cell). 

[0008] Referring to Figure 1, the flow of respective LP and HP request/grant/data 
occurrences are sequential in nature. Each request includes a priority indication, either 
high or low, an identifier, and a destination switch fabric output port indication (not 
shown). The request originates from a line card in a switch/router and is sent to a switch 
fabric of the switch/router for consideration. If the fabric can meet the request, typically 
for the transmission of one cell into the fabric, then it replies accordingly with a grant. 
The grant includes an indication of the same priority and identifier as the request. The 
line card receives the grant after some delay and then transmits the data to the switch 
fabric. For example, at time A a line card sends a HP request to the switch fabric, at time 
B the line card receives a grant in reply, and at time C the line card transmits the data to 
the fabric. Likewise, a similar flow occurs for an LP request sent prior to time A, with 
corresponding grant and data occurrences happening at times prior to times B and C, 
respectively. 

[0009] Referring to Figure 1, the time delay between a request and a grant received in 
reply can be greater than an allowable maximum delay limit for HP traffic, especially if 
the switch/router starts to become overloaded. Where the HP traffic carries real-time 
services this delay could result in an unacceptable degradation in service quality, and 
therefore would be desirable to avoid. Thus, a technique is needed to ensure that 
guaranteed bandwidth commitments are met while maximizing utilization of 
switching/routing capacity. 
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BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS 

[0010] The present invention may be better understood, and its features made 
apparent to those skilled in the art by referencing the accompanying drawings. 

[0011] Figure 1 is a timing diagram illustrating a prior art technique for scheduling 
traffic in a communications node. 

[0012] Figure 2 is a timing diagram illustrating a method for scheduling traffic in a 
communications node in accordance with at least one embodiment of the present 
invention. 

[0013] Figure 3 is a block diagram illustrating an apparatus for scheduling traffic in 
accordance with at least one embodiment of the present invention. 

[0014] Figure 4 is a flow diagram illustrating a method for scheduling traffic in a 
communications node having a plurality of line cards coupled to a switch fabric in 
accordance with at least one embodiment of the present invention. 

[0015] The use of the same reference symbols in different drawings indicates similar 
or identical items. 
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DETAILED DESCRIPTION OF THE INVENTION 

[0016] In accordance with at least one embodiment of the present invention, a 
method and apparatus for scheduling traffic in a communications node is provided. Line 
cards request communication opportunities from a switch fabric. The switch fabric issues 
grants for such communication opportunities in response to specific requests. By 
dynamically adjusting usage of such communication opportunities corresponding to such 
grants among requests of differing priorities and/or latency criteria, embodiments of the 
present invention are able to provide increased capacity utilization of switching fabric 
bandwidth while maximizing adherence to priority requirements and/or latency criteria. 

[0017] Figure 2 is a timing diagram illustrating a method for scheduling traffic in a 
communications node in accordance with at least one embodiment of the present 
invention. Steps in the method occur at various times along time access 201. Steps may 
be characterized as occurring at request phase 202, grant phase 203, and data phase 204. 

[0018] At request phase 202, a line card, such as line card 301 of Figure 3 generates 
and communicates requests for data communication to a switch fabric, such as switch 
fabric 304 of Figure 3. In the illustrated example, the line card first generates and 
communicates low priority request 205, then, at time 213, generates high priority request 
206. 

[0019] At grant phase 203, the switch fabric generates and communicates a grant of 
an opportunity for the line card to use the switch fabric for data communication. In the 
illustrated example, the switch fabric generates low priority grant 207 at time 214 and 
communicates it back to the line card. Then, the switch fabric generates high priority 
grant 208 and transmits it back to the line card. 

[0020] However, the line card maintains the latency requirements of the high priority 
data by utilizing low priority grant 207 as a high priority grant 216 and utilizing high 
priority grant 208 as a low priority grant 217. Thus, the duration 215 between time 213 
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at which the high priority request 206 is made and time 214 at which high priority grant 
216 occurs is maintained at less than the applicable time limit. . 

[0021] At data phase 204, the line card communicates the data corresponding to the 
grants through the switch fabric. In the illustrated example, the line card communicates 
high priority data 209 shortly after high priority grant 216, as illustrated by connection 
212, and communicates low priority data 210 shortly after low priority grant 217. 

[0022] A HP request 206 can take a grant 207 for an LP request 205 if the requests 
are for the same switch fabric output port. Afterward, if a HP grant 208 for that port is 
sent in reply of the HP request 206 then the HP grant 208 can be used by the LP request 
205. For example, an HP request 206 is sent at time 213 and an LP grant 207, received at 
time 215, is used for transmission of HP data 209 to the fabric. In this way, the time 
delay 215 between an HP request 206 and grant 216 is reduced such that it remains 
within the maximum delay limit for HP traffic. 

[0023] The line card can make the grant substitution (e.g., using low priority grant 
207 as high priority grant 216 and using high priority grant 208 as low priority grant 217) 
in a number of ways. For example, the line card can keep track of which requests it has 
pending with the switch matrix and there respective priorities. As grants are received, the 
line card can pass high priority traffic in response to the first available grants and then, 
after the high priority traffic has been passed, pass low priority traffic in response to 
subsequent grants. As another example, the line card can keep track of latency criteria, 
such as any limits imposed on duration 215, and select among traffic to ensure that 
latency criteria are met, or, if not all latency criteria can be met, select among traffic so as 
to either maximize the number of latency criteria that are met or ensure that latency 
criteria designated as most important are met. Also, the two previous examples may be 
combined so that higher priority pending traffic can be passed before lower priority 
pending traffic, and the pending traffic can be passed so as to maximize opportunities to 
satisfy latency criteria. 
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[0024] Figure 3 is a block diagram illustrating an apparatus for scheduling traffic in 
accordance with at least one embodiment of the present invention. A communication 
node 329 may be implemented as a switch/router comprising a plurality (N) of line cards 
301, 302, and 303 coupled to the switch fabric 304. Line card 301 is coupled to switch 
fabric 304 via connection 305. Line card 302 is coupled to switch fabric 304 via 
connection 306. Line card 303 is coupled to switch fabric 304 via connection 307. 
Additional line cards may be coupled to switch fabric 304 via connections 308. Switch 
fabric 304 provides output ports 309. 

[0025] Each line card comprises a plurality (M) of virtual output queues (VOQ) 310, 
311, 312, with each of VOQ 310, 311, and 312 corresponding to a respective output port 
309 of the switch fabric. VOQ 310 provides an output 313, which is coupled to 
connection 305. VOQ 311 provides an output 314, which is coupled to connection 305. 
VOQ 312 provides an output 315, which is coupled to connection 305. 

[0026] Each output queue 310, 311, 312 is preferably configured to service a queue 
structure 330, which is depicted as an inset diagram to provide illustration in greater 
detail. Queue structure 330 comprises a plurality of input queues (Q) 316, 317, 318 and a 
hierarchical arrangement of schedulers (S) 319, 320, 321 in order to support multiple 
classes of traffic, each class having a unique priority level. Input queue 316 is coupled to 
scheduler 319 via connection 322. Input queue 317 is coupled to scheduler 319 via 
connection 323. Additional input queues may also be coupled to scheduler 319. Input 
queue 318 is coupled to scheduler 320 via connection 324. Additional input queues may 
also be coupled to scheduler 320 via connections 325. Scheduler 319 is coupled to 
scheduler 321 via connection 326. Scheduler 320 is coupled to scheduler 321 via 
connection 327. Additional schedulers may be coupled to scheduler 321 via connections 
328. 

[0027] To support multiple traffic priority levels the switch/router architecture of 
Figure 3, at least one embodiment of the invention provides a means pairing higher 
priority requests with lower priority grants of like VOQs (i.e. like fabric output port 
destinations) over a range of traffic priority levels. Basically, a request of a particular 
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priority level and VOQ can be paired with a grant of any lower priority level and same 
VOQ. 

[0028] VOQs may also correspond to a particular class of traffic associated with a 
respective output port, that is, there may be more than one VOQ associated with a 
respective output port. In this case, VOQs of different priorities with like fabric output 
port destinations may be grouped and grants transferred between the VOQs. 

[0029] Furthermore the basis of at least one embodiment of the invention is that the 
priority of the data sent in response to a grant does not need to correspond to the priority 
of that grant, and hence the request. As a result, there is no need for the priority of the 
requests to match the priority of the data present and the priority can be determined by 
other means. Through this, at least one embodiment of the invention can be used to 
ensure that guaranteed bandwidth commitments are met while maximizing utilization of 
switching/routing capacity. For example, where X Mbps of bandwidth are guaranteed by 
the switch/router (e.g. via connection admission control -CAC), each line card sends HP 
requests for only its portion of the X Mbps of guaranteed bandwidth, while LP requests 
are sent for all other traffic. Since LP grants can be taken by HP requests, by the same 
VOQ, the ability of the switch/router to meet bandwidth guarantees is not affected by the 
LP traffic. 

[0030] Figure 4 is a flow diagram illustrating a method for scheduling traffic in a 
communications node having a plurality of line cards coupled to a switch fabric in 
accordance with at least one embodiment of the present invention. The method begins in 
step 401 by sending, to the switch fabric by a first line card of the line cards, a first 
request for transmitting a first unit of the traffic of a first priority to the switch fabric. 
The first request indicates a first output port for which the first unit of the traffic is 
destined. The method continues in step 402 by using, by the first line card, a first grant 
received from the switch fabric permitting transmission of the first unit of the traffic to 
the switch fabric and issued in response to a second request made for a second unit of the 
traffic having a second priority lower than the first priority and being destined to the first 
output port, for scheduling transmission of the first unit of the traffic to the switch fabric. 
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[0031] The method continues in step 403 by using a second grant issued in response 
to the first request for scheduling transmission of the second unit of the traffic to the 
switch fabric. The method may optionally be practiced such that the first priority and the 
second priority are selected from a plurality of priorities corresponding to a respective 
plurality of service classes. The method may also optionally be practiced such that the 
first line card sends the first request after the second request. The method may also 
optionally be practiced such that the first line card sends a first set of requests of a highest 
priority of a plurality of priorities, with the first set of requests corresponding to a first 
quantity of the traffic in an amount of guaranteed traffic flow serviced by the first line 
card, and sends a second set of requests of a lower priority of the priorities for a second 
quantity of the traffic. 

[0032] At least one embodiment of the present invention is useful and beneficial in 
that it offers reduced latency for high priority, real-time, traffic in switches/routers having 
a fixed latency switching fabric. At least one embodiment of the present invention is 
useful and beneficial in that it enables increased capacity utilization for low priority 
traffic while maintaining latency guarantees for high priority traffic. 

[0033] To increase the cost effectiveness of a switch/router, it is desirable to increase 
its capacity utilization. However, QoS guarantees should be maintained while increasing 
capacity utilization. Among these, ensuring that switch latency for high priority, real- 
time, traffic is kept within specified limits is essential. At least one embodiment of the 
present invention provides this capability while enabling additional capacity to be used 
for lower priority traffic. Therefore, at least one embodiment of the present invention 
increases the cost-effectiveness and utility of switching/routing platforms. 

[0034] Thus, a method and apparatus for request/grant priority scheduling has been 
presented. Although the invention has been described using certain specific examples, it 
will be apparent to those skilled in the art that the invention is not limited to these few 
examples. Other embodiments utilizing the inventive features of the invention will be 
apparent to those skilled in the art, and are encompassed herein. 
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